An Approximate Duplicate-Elimination in RFID Data Streams Based on d-Left Time Bloom Filter
نویسندگان
چکیده
Article history: Received 6 March 2010 Received in revised form 16 July 2011 Accepted 18 July 2011 Available online 31 July 2011 The RFID technology has been applied to a wide range of areas since it does not require contact in detecting RFID tags. However, due to the multiple readings in many cases in detecting an RFID tag and the deployment of multiple readers, RFID data contains many duplicates. Since RFID data is generated in a streaming fashion, it is difficult to remove duplicates in one pass with limitedmemory.We propose one pass approximate methods based on Bloom Filters using a small amount of memory. We first devise Time Bloom Filters as a simple extension to Bloom Filters. We then propose Time Interval Bloom Filters to reduce errors. Time Interval Bloom Filters need more space than Time Bloom Filters. We propose a method to reduce space for Time Interval Bloom Filters. Since Time Bloom Filters and Time Interval Bloom Filters are based on Bloom Filters, they do not produce false negative errors. Experimental results show that our approaches can effectively remove duplicates in RFID data streams in one pass with a small amount of memory. © 2011 Elsevier B.V. All rights reserved.
منابع مشابه
Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams
Data intensive applications and computing has emerged as a central area of modern research with the explosion of data stored world-wide. Applications involving telecommunication call data records, web pages, online transactions, medical records, stock markets, climate warning systems, etc., necessitate efficient management and processing of such massively exponential amount of data from diverse...
متن کاملStreaming Quotient Filter: A Near Optimal Approximate Duplicate Detection Approach for Data Streams
The unparalleled growth and popularity of the Internet coupled with the advent of diverse modern applications such as search engines, on-line transactions, climate warning systems, etc., has catered to an unprecedented expanse in the volume of data stored world-wide. Efficient storage, management, and processing of such massively exponential amount of data has emerged as a central theme of rese...
متن کاملISSN 1330-3651 (Print), ISSN 1848-6339 (Online)
Original scientific paper Recently, RFID technology has been widely used in many applications such as object monitoring and tracing due to the unique features such as noncontact, automatic, fast and multi-target identification simultaneously. However, because of the interference of environmental factors and the requirement of real-time detection, the data collected by the RFID readers are often...
متن کاملAn Overview on Various Rfid Data Filtering Techniques Based on Bloom Filter Approach
The unreliability data reading such as noise, missed reading and duplicate reading that were produced by RFID reader has become the primary factor limiting the widespread adoption of RFID technology. It is compulsory to filter the raw data to maintain its reliability of data reading because a slight decreasing of effective read rate will reduce the accuracy and reliability of RFID. One of the a...
متن کاملEfficiently Filtering RFID Data Streams
RFID holds the promise of real-time identifying, locating, tracking and monitoring physical objects without line of sight, and can be used for a wide range of pervasive computing applications. To achieve these goals, RFID data has to be collected, filtered, and transformed into semantic application data. RFID data, however, contains false readings and duplicates. Such data cannot be used direct...
متن کامل